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Abstract 

Consider the edge-deletion process in which the edges of some finite tree T are removed one 
after the other in the uniform random order. Roughly speaking, the cut-tree then describes the 
genealogy of connected components appearing in this edge-deletion process. Our main result 
shows that after a proper rescaling, the cut-tree of a critical Galton- Watson tree with finite 
variance and conditioned to have size n, converges as n ^ oo to a Brownian CRT in the weak 
sense induced by the Gromov-Prokhorov topology. This yields a multi-dimensional extension of 
a limit theorem due to Janson [13] for the number of random cuts needed to isolate the root in 
Galton- Watson trees conditioned by their sizes, and generalizes also a recent result [8] obtained 
in the special case of Cayley trees. 
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1 Introduction and main results 
1 . 1 Motivations 

Random destruction of combinatorial trees is an old topic which can be traced back more than 40 
years ago to the work of Meir and Moon [15]. Let T be a rooted tree on a finite set of vertices. 
Imagine that we pick a vertex uniformly at random and destroy it together with the entire subtree 
generated by that vertex. We iterate in an obvious way until the root is picked and are interested in 
the number N{T) of steps of this algorithm. 
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The present paper has been motivated by the following result due Janson [13] who treated the case 
when T is a large Galton- Watson tree. More precisely, consider the genealogical tree of a branching 
process having a critical reproduction law with finite variance o"^ > 0, and let 7^ be a version of this 
tree conditioned to have exactly n vertices, assuming implicitly that the probability of that event is 
positive. Then Janson established that N{Tn)/{o'y/n) converges weakly as n — )■ oo to the Rayleigh 
distribution which has density xexp(— on IR+. See also Panholzer [16] for the same result in a 
less general setting, and Abraham and Delmas [1] for a recent contribution and further references. 

The following extension has been recently obtained in [8] . Let T„ be a uniform Cayley tree with n 
vertices; it is well-known that this corresponds to a special case of conditioned Galton- Watson trees, 
namely when the reproduction law is Poisson. Given T„, distinguish k vertices uniformly at random, 
where k is some fixed integer. Then remove an edge uniformly at random and independently of 
the distinguished vertices. This disconnects T„ into two subtrees. If one of these subtrees does not 
contain any of the distinguished vertices, then we destroy it entirely, else we keep the two subtrees. 
We iterate until each and every distinguished vertex has been isolated and denote by F(T„, k) the 
number of steps. Then according to Lemma 1 in [8], Y{Tn-,k)/ \/n converges weakly as n ^ oo to 
the Chi distribution with parameter 2k, which has density 

^^a;^^-^exp(-a;V2) 

on ]R_|_. This result has also been very recently recovered by [2] using a different approach. 

The Chi(2A;) distribution occurs as the law of the length Lfc(T) of a Brownian Continuum Random 
Tree T reduced to k leaves picked uniformly at random, as it can be seen from Lemma 21 in Aldous 
[3]. The appearance the Brownian CRT in this framework should not come as a surprise since it 
is well-known that if we assign length 1/v^ to each edge of T„, then the latter converges weakly 
to a Brownian CRT T as n — )■ oo. We stress however that the rescaled Cayley tree n~^l'^Tn and 
n~^/^y(T„, k) do not converge jointly in distribution towards T and Lfc(T). 

The proof in [8] of the extension above of Janson's result relies on three crucial features. First, 
the observation due to Pitman [17] that random deletion of edges in a uniform Cayley tree yields a 
remarkable fragmentation process, second, a general limit theorem due to Haas and Miermont [12] 
for so-called branching Markov trees, and third, the characterization of the Brownian fragmentation 
in [6]. More precisely, the fragmentation process that results from the repeated deletion of edges in 
a uniform Cayley tree can be represented by a branching Markov tree whose law is known explicitly. 
In this setting, Y{%i, k) corresponds to the length of this branching Markov tree reduced to k leaves 
picked uniformly at random. Thanks to the limit theorem of Haas and Miermont, one then checks 
that this branching Markov tree with lengths rescaled by a factor 1 / i/n converges weakly, and the 
limit can then be identified as another Brownian CRT, say T', using the characterization of the 
fragmentation process at heights induced by the latter. As a consequence, n~^^^Y{Tn,k) converges 
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weakly to Lfc(T') and hence to the Chi(2A;) law. Unfortunately, this approach only works for Cayley 
trees, as for other conditioned Galton- Watson trees, random edge deletion does not yield in general 
a branching Markov tree as above, and the entire structure of the proof collapses. Nonetheless, the 
fact that Janson's result is valid for any critical Galton- Watson tree with finite variance suggests 
that the same should also hold for its natural extension to k vertices for A; > 2. 

The first purpose of this work is to show that this is indeed the case. For the sake of convenience, 
we shall deal with a slightly modified model in which we distinguish edges rather than vertices, and 
which is easily seen to have the same asymptotic behavior as the former. The precise framework and 
result are presented in Section 1.2 below. 

Our main goal, in the spirit of [8] , will be to prove a convergence result for the genealogy induced 
by the edge-deletion procedure, even though this process does not satisfy in general the Markov 
branching property of [12]. In Section 1.3, we introduce the cut-tree of a finite tree, which roughly 
speaking records the genealogy of blocks in the edge-deletion process which consists of removing 
edges of that tree one after the other and in uniform random order. In Section 1.4, we define the 
cut-tree of a Brownian CRT T, relying on a Poissonian logging process on the skeleton of T which 
has been constructed by Aldous and Pitman [4] to study the so-called standard additive coalescent. 
Our main result claims the joint weak convergence of Tn and its cut-tree suitably rescaled towards 
their continuous counterparts, namely T and cut(T); it is stated in Section 1.5. 

After this long Introduction, the rest of this paper will be organized as follows. Section 2 is devoted 
to preliminary results that will be used in the proof of Theorem 1, and the latter is established in 
Section 3. Section 4 is devoted to the proof of a technical bound, relying partly on an invariance 
property under random re-planting for Galton- Watson trees, which may be of independent interest. 

1.2 The number of cuts needed to find a few edges 

It will be convenient in the sequel to work with a slight modification of the trees under consideration. 
Consider a (rooted) tree T on a set of n vertices, say [n] = {1, . . . ,n}, we add a new vertex which 
we call the base and link to the root of T by a new edge. This gives a planted tree which we denote 
byf. 

The set E of edges of T is thus given by the set E of edges of T plus the new edge connecting the 
base to the root. We consider ^ as a set of vertices, and endow it with a natural tree structure by 
declaring that e and e' are neighbors in E if and only if they are adjacent in T. Plainly, this yields a 
tree which is isomorphic to T; more precisely the map v : E ^ [n] that associates to an edge e of T 
its extremity v{e) G [n] which is the farther away from the base vertex in T is bijective and preserves 
the tree structures. Any statement expressed in terms of the edges of T can thus be rephrased in 
terms of the vertices of T and vice versa. For a technical reason, it will be slightly simpler for us to 
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Figure 1: Planting 



work with the edge-version rather than the vertex-version of conditioned Galton- Watson trees. 

As before, we consider a critical reproduction law v with finite variance cr^ > 0. Denote by p 
the smallest common divisor of the support of v] and observe that when n is a sufficiently large 
integer, the probability that the total population of a Galton- Watson process with reproduction law 
V generated by a single ancestor is positive if and only if n — 1 e pN. 

Let %i denotes a version of a Galton- Watson tree with reproduction law v conditioned to have 
exactly n vertices, which are enumerated for instance in the breadth-first search order to yield a 
tree-structure on [n] as required. Of course the vertex corresponding to the ancestor serves as the 
root. We shall imphcitly restrict our attention to the case n — 1 e pN, so this conditioning makes 
sense provided that n is large enough. Recall that the associated planted tree is denoted by Tn and 
has n edges. 

Next for every integer A; > 1, given 7^, we distinguish k edges in Tn uniformly at random. 
Conditionally on 7^, we pick an edge uniformly at random and independently of these k distinguished 
edges. We remove it from Tn, this disconnects Tn into two subtrees. We then only consider subtrees 
that contain at least one of the distinguished edges, discarding eventually those that contain no 
distinguished edge. We iterate until each and every distinguished edge has been removed, and write 
N{Tn, k) for the number of steps. We shall prove the following. 

Proposition 1 In the notation above, 




converges in distribution as n ^ oo to the Chi distribution with parameter 2k, i.e. 



{k-l)\ 



x^^ ^ exp(-x^/2)dx. 



X > 0. 
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1.3 Cut-trees of finite trees 



We can be more accurate by keeping track of the genealogy induced by the edge-deletion process 
depicted above. More specifically, let T be a rooted tree with n vertices and T its planted version. 
Recall that T has n edges which are naturally enumerated by the map v : E ^ [n] described in the 
preceding section. 

Then we destroy entirely T by removing inductively its edges, uniformly at random one after the 
other. For j = 1, . . . ,n, we denote by ij the label of the edge that is removed at the j-th step, so 
(ii, . . . ,in) is a uniform random permutation of [n]. We partly encode this edge-deletion process by 
another tree, which we denote by cut(T) and construct as follows. For every r = 0, . . . , n — 1, let 
n(r) be the partition of E{r) := {1,2,..., n}\{ii, . . . ,ir} obtained by specifying that two elements 
j and j' in E{r) are in the same block of n(r) if and only if either j = j' or the edges with labels 
j and / are still connected in the forest obtained from T by deletion of the r first edges with labels 
ii, . . . ,ir. The family of the blocks (without repetition) of the partitions 11 (r) for r = 0, . . . , n — 1 
forms the set of internal nodes of cut(T), the initial block [n] of 11(0) being seen as the root. The 
leaves of cut(T) are given by 1, . . . , n; we stress that a singleton {i} may appear as an internal node 
of cut(T) and should not be confused with the leaf i. 

Now consider the r-th step at which the edge labeled ir is removed, and let B denote the block of 
n(r — 1) which contains ir- There are three possibilities. First, B is reduced to the singleton {v}; 
in that case we draw a single edge between the internal node B — {ir} and the leaf ir- Second, B 
is not a singleton and B\{ij.} — B' is a, block of n(r); then we draw an edge between the internal 
nodes B and B', and another edge between B and the leaf Third, there are two distinct blocks 
B' and B" of n(r) which result from B, i.e. B — B' U B" U {ir}. Then we draw an edge between 
the internal nodes B and B', a second edge between the internal nodes B and B", and a third edge 
between B and the leaf ir- If T" is a random tree, we define cut(T') by first conditioning on T and 
then performing the above construction. 

The main purpose of this work is to determine the asymptotic behavior (in distribution) of cut (7^) 
for a Galton- Watson tree Tn with size n, as n — > oo. In this direction, it is appropriate to work in 
the framework of pointed metric measure spaces. More precisely, a finite tree T with n vertices can 
be identified as {[n], dn, /in) where [n] = {1, . . . , n} is the set of vertices, dn is the graph-distance on 
[n] induced by T, and the uniform probability measure on [n]. We further retain the fact that 
T is rooted by distinguishing in [n] the root vertex (usually 1, for instance when T is a genealogical 
tree and vertices are labelled according to the breadth-first search order) . 

It will be convenient to adopt a slightly different point of view for cut(T), by focussing on leaves 
rather than internal nodes. More precisely, we set — [n] U{0} = {0, 1, . . . , n} where corresponds 
to the root [n] of cut(T) and 1, . . . , n to the leaves, and consider the (random) metric measure space 
([n]°, //„) where 5„ is the (random) graph distance on induced by cut(T), and, by a slight 
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Figure 2: The tree cut(T) of a planted tree T. The vertices are labeled in Arabic numerals in 
breadth-first order for this planar representation of T, while the order of deletion of the edges is 
indicated in lowercase Roman numerals. Each internal vertex in the tree to the right is naturally 
labeled by the set of leaves that lie in the subtree above this node. 

abuse of notation, the uniform probability measure on [n] extended by //n(0) = 0. It is easily seen 
that cut(T), and in particular its combinatorial structure, can be recovered from 

We stress that in this framework, the root of the cut-tree (which corresponds to the additional 
point in [n]°) has a crucial role. Indeed, the height (distance to the root) of the leaf i in cut(T) is 
precisely the total number of edge-removals from the successive blocks containing i until the edge i is 
finally removed. More generally, the number of internal nodes of the tree cut(T') reduced to its root 
and k leaves, say ii, . . . ,ik, coincides with the total number of edge-removals from the blocks which 
contain at least one of those k leaves until each and every of the edges with labels £j in T have been 
removed. Note also that this number differs from the total length of that reduced tree by at most k 
units. For this reason, it will be important to recall that has been singled out in the metric space 



Aldous and Pitman [4] considered a cutting process on the Brownian CRT which bears obvious 
similarities with that defined in the preceding section for finite trees. We recall and develop some 
features in this setting that will be useful for our purpose. 




1.4 



Cut-tree of a Brownian CRT 
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Let us first recall some basic facts about topologies on metric measure spaces that we will need. 
A pointed metric measure space is a quadruple (X, d, fi, x) where {X, d) is a complete metric space, 
X & X and is a Borel probability measure on [X, d) . 

Two such spaces {X,d,ii,x) and {X' ,d' , /i' ,x') are called isometry-equivalent if there exists an 
isometry / : supp(;u) U {x} — )■ X' (here supp is the topological support) such that f{x) — x' and the 
image of // by / is /i'. This defines an equivalence relation between pointed metric measure spaces, 
and we note that the representatives {X, d, /i, x) of a given isometry-equivalence class can always be 
assumed to have supp(/x) U {x} — X. 

The set M of (isometry-equivalence classes of) pointed metric measure spaces is a Polish space 
when endowed with the so-called Gromov-Prokhorov topology. Gromov's book [10] and the paper [9] 
are good references, although they deal with non-pointed spaces, which differs from our setting only 
in a minor way. Recall that a sequence (X„, d„, Xn) of pointed measure metric spaces converges in 
the Gromov-Prokhorov sense to (Xqo, d^o, ^oo, ^oo) if and only if the following holds: For n G NU{oo}, 
set ^n(O) = Xn and let ,^„(1), ^„(2), ... be a sequence of i.i.d. random variables with law then the 
vector idniU{i),^n{j)),0 <i,j < k) converges in distribution to (cioo(Coo(0> Coo(j))> <hj < k) for 
every k>l. 

Now recall that T denotes a Brownian CRT. It is endowed with the uniform probability "mass" 
measure // and the usual distance d, and also comes with a distinguished point called the root [3]. 
Therefore, T is viewed as a random variable in M. Note that the root plays the same role as a 
//-randomly chosen point in T, which is usually called the invariance property of T under re-rooting. 

The distance d induces an extra length measure A, which is the unique cr-finite measure assigning 
measure d{x,y) to the geodesic path between x and y in T. Roughly speaking, the probability 
measure /i is carried by the subset of leaves of T while the length measure A rather lives on the 
skeleton, i.e. the complementary of the set of leaves. 

Conditionally on T, we introduce the family {ti, Xi)i^i of the atoms of a Poisson random measure 
with intensity di dA, where / is a countable index set. We view these atoms as marks that are 
deposited along the skeleton of T as time grows. Let T{t) be the "forest" obtained by removing the 
points {xi i & I ,ti < t} that are marked before time t. For every x e T we let /ixit) be the //-mass 
of the component of T{t) that contains x, where by convention we let /ixit) — H x — Xi for some 
i & I with ti < t. 

Aldous and Pitman [4] have observed that if ^ denotes a random point in T distributed according 
to II and independent of the Poisson point process of marks on the skeleton, then the processes 

{pL^{t),t > 0) and (1/(1 + a{t)),t > 0) have the same law, (1) 

where {a{t),t > 0) denotes the first-passage time process of a linear Brownian motion. Specializing 
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results of [5] in this setting, we easily deduce that if we define 

roo 

hx^ IJ'x{t)dt, X eT , 
Jo 

then 

has the Rayleigh distribution; (2) 

see Section 3.2 below for details. As a consequence < hx < oo a.s. for //-almost every x. 

We next add to T an extra point, denoted for simphcity by 0, and write T*^ = T U {0}; serves 
of course as distinguished element of T°. We define a (random) function S of two arguments in T° 
by setting 

poo 

S{0, 0) = , S{0, x) = S{x, 0)^hx, and S{x, y) = / + fXy{t))dt , 

where for x, y e T with x ^ txy denotes the (a.s. finite) smallest time t when x and y belong to two 
distinct components of T(i). Note that txy is the first time where a mark appears on the geodesic 
from X to y, and as such it has an exponential distribution of parameter d{x,y). Observe also that 
Ijl{{x e T : S{0,x) — 0}) = a.s. since > a.s. and that for every y G T, //({x G T : 5{x,y) — 
0}) = a.s. since //^(i) > for all i > 0, a.s. 

Let ^(0) = and > 1) be an i.i.d. sequence with law fj, conditionally given T. We will see 

in Lemma 4 below that the two random semi-infinite matrices 

(di^i + l),e(j + 1)) ■■i,j> 0) and m(i),m) ■■hj> 0) (3) 

have same distribution^. In particular, S IS cl.S. db distance on the set > 0}, and TZ{k) = 

{{^{i),0 < i < k},6) can be understood as a consistent family of random rooted trees with respec- 
tively k leaves in the sense of Aldous [3] — here, the spaces TZ{k) have k + 1 elements while they 
should really be "trees with edge-lengths" in the context of Aldous' paper, but this is only a minor 
difference that does not affect our discussion. Since T satisfies the so-called leaf-tight property 

infrf(e(l),C(0)=0, a.s., 

the family {TZ{k),k > 1) also satisfies this property with probability 1, even conditionally given 
T and the Poisson cuts {ti,Xi)i^i. By Theorem 3 in [3], this shows that {TZ{k),k > 1) admits a 
representation as a continuum random tree, that we call cut(T). This means that, still given T and 
the process of Poisson marks, cut(T) is a pointed metric measure space, with underlying distance 

^The shift of indices in the left-hand side comes from the fact that the distinguished point is not the root of T, 
and formally it is not even an element of the latter 
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function r. root Xq and probability measure m, and that if Xi, X2, ... is an infinite i.i.d. sequence with 
distribution m, then the matrix {r{xi,Xj) : i,j > 0) has same distribution as (5(6) 0) '■ hj ^ 0). 
Up to performing this "resamphng", we thus see that '■ i,j > 0) can itself be seen as 

the matrix of mutual distances between the points of an i.i.d. sample of cut(T). In the previous 
discussion, we insisted on conditioning first on T and (tj to underline the fact that the 

random elements T, (ti,Xi)ifzi and cut(T) arc defined on a common probability space. Of course, the 
equality in distribution (3) entails that unconditionally, the random variable cut(T) has the same 
distribution as T. 

Let us make a remark at this point. The reader might consider more natural to define cut(T) in a 
more "concrete" way, by first taking the quotient space of (T°, S) by the relation {{x, y) : 5{x, y) = 0}, 
and then taking a metric completion. This operation comes along with a natural mapping that first 
projects from T° onto the quotient space, and then injects in the completion. Therefore, we could 
endow the space with the image measure of fi by this natural mapping. However, several measurability 
issues appear here: First, this mapping should be measurable in order that this construction makes 
sense, and second, it should be checked that the law of the resulting random metric measure space is 
indeed a measurable function of the tree T. These issues do not appear in our context because Aldous' 
construction defines cut(T) only in terms of the sequence of random variables ^(j)) '■ i,j > 0). 

We believe that these issues can be overcome, however we are not going to consider them here to 
keep this work to a reasonable size. 

1.5 Main result 

If X = {X, d, /I, x) is a pointed metric measure space and a > 0, we let aX = {X, ad, /i, x) be the 
same space with distances rescaled by the factor a. 

Theorem 1 As n ^ 00, we have the following joint convergence in distribution inMx M, endowed 
with the product topology (M having the Gromov-Prokhorov topology): 



Moreover, cut(T) has same distribution as T. 

It may also be convenient, e.g. for readers who would not feel at case with weak convergence in 
the sense induced by the Gromov-Prokhorov topology, to rephrase the first part of Theorem 1 as 
follows (and this is actually what we shall prove). For every n G N, set ^n{^) = and consider a 
sequence {^n{i))i>i of i.i.d. variables having the uniform distribution on [n]. Let also ,^(0) = and 
given a CRT T, let {^{i))i>i be a sequence of i.i.d. variables in T distributed according to the mass 
measure In the notation of Section 1.3 and L4, we have: 
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Theorem 2 As n — > oo, the following two weak convergences hold jointly in the sense of finite- 
dimensional distributions: 



n 



and 

1 



We conclude this Introduction with three remarks. First, it follows from Theorem 1 that 

^ cut(7;)=^T. 



a^/n 



In particular, the total length Lfe(cut(7^)) of the cut-trcc of Tn reduced to k leaves chosen uniformly 
at random, converges after renormalization by a factor {a^/n)^^ to the total length of a Brownian 
CRT reduced to k i.i.d. leaves picked according to the mass measure /i. Since the latter is known 
to have the Chi(2fc)-distribution and Lk{cut(7'n)) only differs from the number N{Tn,k) of edge- 
deletions which are needed to recover k distinguished edges picked uniformly at random in 7^ by at 
most k units, we thus see that Proposition 1 follows from Theorem 1. 

Second, we do not know whether the weak convergence stated in the theorem holds for stronger 
topologies of the Gromov-Hausdorff type, even though this is indeed the case when we only consider 
the first component. 

Third, in order to ease the presentation, it has been convenient to work with trees on a set of 
labelled vertices, namely [n]. This induces a structure which is not relevant for our main results, as 
these could be stated in terms of graph-theoretic trees over a finite set of vertices considered up to 
graph isomorphisms. The labeling of the vertices comes naturally when considering Galton- Watson 
trees; however it could be ignored after the object is sampled. 



2 Some preliminary results 

The proof of Theorem 1 is rather long and rehes on several intermediate results. 
2.1 A modified distance on cut-trees 

In this section n is fixed; we consider an arbitrary tree T on [n] and write as usual T for its planted 
version with n edges. We shall define a modified distance on its cut-tree, which is both close to the 
rescaled initial distance on cut(T') and resembles the distance defined in Section 1.4 for the Brownian 
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CRT. 

Imagine that we mark each edge e E T with rate l/\/n, independently of the other edges. In 
particular the first mark is assigned after an exponentially distributed time with mean l/\/n, and 
the edge which is first marked is independent of that time and has the uniform distribution on 
the set E of the n edges of T (recall that the set of edges of the planted tree T can be canonically 
identified with the set [n\ of vertices of T, so that by a slight abuse, jiT denotes indistinctively the 
uniform probability measure on the set of vertices of T or on the set of edges of T). If we remove 
the edge e at the instant when it is marked, then wc obtain a continuous time version of edge- 
deletion process described in Section 1.3. We denote by 5t the cut-distance on the set of vertices 
[n]° = [n] U {0} which has been defined in that Section. Recall that in this setting, should be 
thought of as the root, 1, . . . , n as leaves, viewed as the edges of T, and then 5t(0, i) is given by the 
number of edge-removals that are performed on the successive blocks containing i until i is finally 
removed. 

For every t > 0, we write T(t) for the random forest that results from the edge deletion process 
at time t, and for every i G [n], Ti{t) for the tree-component of T{t) which contains the edge labelled 
by i, agreeing of course that Tj(t) is empty whenever i has been removed before time t. We also 
set [JiT,i{t) = fJ^T(Ti(t)); this quantity gives the number of edges of Ti{t) normalized by a factor 
1/n. Mimicking the construction of the cut-distance on the Brownian CRT in Section 1.4, we now 
introduce 

5^(0, 0) = , S'j,{Q, i) = S'^ii, 0) = / i^T,iit)dt , i e [n] 

Jo 

and 

poo 

^'Ahj) = / (/^r,i(i) +//T,j(i))di, i,j e [n] 
Jtij 

where tij is the first instant t at which the edges i and j become disconnected in T{t). 

We have thus endowed with two distances, St and Sip related to the edge-deletion process; 
and our purpose here is to observe that -^St and S'rp are close when n is large. Here is a precise 
statement. 



Lemma 1 For every i & [n], we have 

1 



E 



n 



STiO,i)-S'TiO,t] 



n 



E(S'AO,i)) 



and, as a consequence, for every j G [n], we have also 



E 



n 



<^E{S't{0,z) + S't{0,j)). 
V ri 
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Proof: Plainly, it suffices to establish the first equality as the upper-bound then follows from the 
triangle inequality. 

Denote by Nj^{t) the number of edges that have been removed up to time t from the tree- 
components that contain the edge i; in particular 

\imN^{t)^ST{0,i). 

t-^oo 

Since each edge of T{t) is removed with rate independently of the other edges, the process 

M{t) = -^N^{t) - [ fiTAs)ds , t>0 
Jo 

is a purely discontinuous martingale with terminal value 

1 ■ f°° 1 

lim M(t) = —N'(oo) - / i^Tiis)ds = — 5t(0,«) - 6'J0,t) . 

Further, its quadratic variation is [M]t = n~^N^{t) and thus its oblique bracket is given by 

1 /■* 
V ^ ^0 



As a consequence, we have 

1 



E 



5t(0,z)-5^(0,z) 
n 



n 



which is our statement. □ 
2.2 Joint convergence of the subtree sizes 

Recall that Tn denotes a Galton- Watson tree corresponding to critical reproduction law with finite 
variance > and conditioned to have size n. We know from Aldous [3] that an~^^'^Tn converges 
in distribution to a Brownian CRT. Motivated by Lemma 1, the purpose of this section is to point 
out that this convergence can be reinforced to hold jointly with that of the rescaled sizes of sub-trees 
appearing in the edge-deletion processes. 

Before providing a rigorous statement, we need to introduce some further notation. For n fixed, 
given the random tree Tn, we consider a sequence > 1) of i.i.d. random variables in [n], each 

having the uniform distribution yU„ on [n]. We stress that the should be viewed as random edges 
of the planted tree Tn, although in this section it will sometimes be convenient to think of the latter 
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as vertices of 7^. Marking randomly each edge of 7^ at rate ^/^/n as in Section 2.1, we denote for 
every t > by IJ-n,£,„{i){t) the number of edges of the tree-component containing the edge ^n(0 i'^ 
forest at time t, Tnit), and normahzed by a factor 1/n. We also denote by Tn{i,j) the first instant 
when the edges ^n(i) and are disconnected in the forest Tn{t)- 

Next, we consider the Brownian CRT T together with the Poisson point process of marks on 
its skeleton as in Section 1.4, and an independent sequence > 1) of i.i.d. random variables 

in T distributed according to the uniform measure fj,. Recall that for every t > 0, fJ-^{i){t) denotes 
the /i-mass of the tree-component containing ^{i) in forest T(t) that results from T by cutting its 
skeleton at marks which appeared before time t. Finally, we denote by T{i,j) = the first 

instant when the points ^(i) and ^{j) are disconnected, i.e. the first time when a mark is put on the 
segment joining ^(i) and ^(j) in T. 

We are now able to state: 
Lemma 2 As n ^ oo, we have the following weak convergences 




{rn{i,j) : i,i e N) ^ (T(i,j) : i,j e N) 

and 

{l^n,^n{i){t) ■t>0 and i e N) =^ (/^ew(^) : ^ > and i e N) 

where the three hold jointly; the first in the sense induced by the Gromov-Prokhorov topology, and the 
second and third in the sense of finite- dimensional distributions. 

Proof: The proof follows closely arguments developed by Aldous and Pitman [4] in a similar setting; 
see Section 2.3 there. 

We first consider the edge-deletion process on %. rather than on its planted version %, and view 
the random variables (Cn(^), ^ > 1) as a sequence of vertices of 7^ rather than edges of Let 7l{n, k) 
denote the subtree of %i spanned by the first k random vertices {inij)-i 1 < ^ < ^} and the root of 
Tn- Similarly, we denote by TZ{oo, k) the subtree obtained from the CRT T by reduction to its root 
and the first k i.i.d. variables {^(^), 1 < i < A;} with common distribution the mass-measure // on 
T. Here, we adopt the framework of Aldous [3], viewing the reduced trees as a combinatorial rooted 
tree structure with edge lengths and leaves labeled by 1, . . . , /c. As it was already stressed, we know 
from the work of Aldous [3] that there is the convergence 

-^Tn ^ T , (4) 
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and this can be rephrased in terms of reduced trees as 

-^n{n,k) ^n{oo,k), (5) 

where -^TZ{n, k) has the same tree-structure as 7^(n, A;) but with edge lengths rescaled by a factor 
a/y/n. We shall now see how (4) can be enriched to encompass the further convergences in the 
statement. 

Next, we write {TZ{n, k,t) : t > 0) for the reduced tree TZ{n, k) endowed with a point process of 
marks on its edges. More precisely, each edge receives a mark at its midpoint precisely at the time 
when this edge is removed as in Section 2.1. Similarly, we denote by {TZ{oo, k,t) : t > 0) for the 
reduced tree Tl{oo, k) endowed with a Poisson point process of marks on its skeleton with intensity 
dt®dA, where by slightly abusive notation, A is now the length measure on the reduced tree 7^(oo, k). 
It is then easy to extend (5) to 

(^-^n{n, k,t) :t>0^ ^ (7^(oo, k,t/a) : t > 0) (6) 

on the space of rooted trees with k leaves and edge-lengths, endowed with an increasing process of 
marked points; this space being equipped with the appropriate topology. More precisely, the time- 
rescaling with a factor l/cr in the right-hand side stems from the fact that the edges in 7l{n, k) have 
been rescaled by a/\/n. 

Then, for every i > 1, denote by ri{n,k,i,t) the number of vertices among ,^„(1), . . . , ,^„(A;) in 
the tree-component containing the vertex ^nii) which results from Tl{n, k) by cutting at marks that 
appeared before time t. Denote also by T^{i,j) the first instant when a mark appears on the segment 
in TZ{n, k) connecting and 

Similarly, let rj^oo, k,i,t) be the number of vertices among ^(1), . . . ,C,{k) in the tree-component 
containing the vertex ^(i) which results from TZ{oo, k) by cutting at marks that appeared before time 
t. It follows (6) that 

(r/(n, k, i,t) ■.t>Q and i e N) =^ {r]{oo, k, i, t/a) : t > and i e N) 

and 

KiiJ) : i,i e N) ^ (ariij) : ij e N) (7) 

in the sense of finite dimensional distributions. More precisely, these convergences hold jointly, also 
together with (4). 
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The law of large numbers entails that for each fixed i and t >0, 

lim k~^r]{oo,k,i,t/a) = ii^u\{t/a) , almost surely. 

k-^oo 

We deduce that for every fixed integer i and times < ti < . . . < te, we can construct a sequence 
A;„ — >■ oo sufficiently slowly, such that 

{k~^r](n,kn,i,tj) : 1 < i,j < £) ^ {ii^^i)(tj/a) : 1 < i,j < £) , 

and again this convergence holds jointly with (4) and (7). 

This is essentially the sought-after result; the only minor difference being that we viewed the ^n(i) 
as vertices of %. rather than edges of the planted tree However it is easy to check that with 
high probability, this makes no difference when n is large. Indeed, we realize that on the event that 
the edges Cn{j) of Tn have not been marked before time t, which has probability greater 

than 2exp{—t/y/n) — 1 — >■ 1 as n — >■ oo, r]{n,k,i,t) differs from kiin,^„(i){t) by at most one unit 
(recall that a tree with j vertices has j — 1 edges). Similarly, on the event that the edges ^n{i) and 
^n{j) have not been removed when the segment connecting C.ni'i) and ^n(j) receives its first mark, 
which has probability E (^^(^^(i), ,^„(j))/(2 + dn{^n{i) , ^n{j))) ^ 1 as n — >■ oo, there is the identity 
'Tnihj) — 'T'nihj)- This Completes the proof. □ 



2.3 A uniform bound 

The next technical step in the proof of Theorem 1 is the obtention of a uniform bound for the first 
moment of the size of a 'typical' tree component occuring in the edge-deletion process for Galton- 
Watson trees. Specifically, recall that = 'Cn(l) is a random edge of Tn picked according to the 
uniform probability measure yU„ and independently of the edge-deletion process, and that 
denotes the number of edges in the tree component of Tn{t) which contains the random edge and 
rescaled by a factor 1/n. We claim the following: 

Lemma 3 There exists some finite constant C > depending only on the offspring distribution v 
such that 

n{l — exjp{—t/ ^^/n))^ 

for allt>0 and neN. 

We stress that this bound is only relevant when t is not too small, since the left-hand side is always 
less than or equal to 1. 

The proof of Lemma 3 relies crucially on an estimate due to Janson [13] and an invariance property 
under random re-rooting for planted Gallon- Watson trees. It is convenient to postpone its proof to 
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Section 4; wc merely conclude this section with a consequence of that lemma which will be used in 
the proof of Theorem 1 . 



Corollary 1 There exists a finite constant C > depending only on the offspring distribution u 
such that 

E(5;(en,0)) <C" for all n en, 
where S!^ denotes the modified distance on cut{Tn) defined in Section 2.1. 

Proof: An apphcation of Lemma 3 at the second hne below gives 



/•oo 

EK(e„,0)) = / E(/.„,^„(i))di 



A n(l-exp(-VV^))2 

C 

^/n{l - exp(-l/v^)) ' 

and this last quantity remains indeed bounded as n — )■ oo. □ 



3 Proof of Theorem 1 



3.1 The cut-tree of a Brownian CRT is another Brownian CRT 

In this section, we complete the construction of cut(T) that was performed in Section 1.4, and show 
that it has the same distribution as the Brownian CRT. Both will follow from the following lemma. 

Lemma 4 Set ,^(0) = and let {^{i) : i G N) denote a sequence of i.i.d. points in T distributed 
according to the uniform probability measure fi. Then there is the identity in law 

{dm + 1),C(J + 1)) : ^,3 > 0) ^=\sm,m) ■■ hJ > 0) . (8) 

As a warmup, we first provide a short proof of the one-dimensional identity in (8), i.e. for i and 
j fixed. By a well-known property of invariance in law of the Brownian CRT under random uniform 
re-rooting, it suffices to treat the case i = and j = 1, and we thus consider the cut-distance 5(0, ^) 
of a random point ^ with law /j, to the root {(0) ~ 0. In the notation of Section 1.4, we have 

/■oo 

Jo 



16 



Applying the identity in distribution (1), we see that this variable has the same law as the Cauchy 
transform 

where {a{t),t > 0) is the stable(l/2) subordinator given by the first-passage time process of a 
standard Brownian motion, i.e. with Laplace exponent $(r) = \/2r. 

According to Corollary 3 and Lemma 3 of [5] specified to subordinators, we have 

P(C((t) < t) = 1 - exp(-7(t)) , t>Q, 

where 7 denote the inverse of the function t — >■ $(r)~-'^dr. For $(r) = -\/2r, we get ^{t) — 
and conclude that the distribution function of C((7) is i — > 1 — exp(— which is the distribution 
function of the Rayleigh law. The latter coincides with the distribution of the height d(0, ^) of a 
point picked uniformly at random in T, and we conclude that indeed (8) holds in the weaker sense 
of one-dimensional distributions. We note passing by that the claim (2) is now estabfished. 

Unfortunately such direct calculations are not available for multidimensional distributions, and 
we shall use a different approach which relies on a general features of self-similar fragmentations. We 
thus start by developing elements in this area and refer the reader to [7] and in particular Chapters 
2 and 3 there for background. 

For this purpose, it is convenient to work in the setting of processes with values in the space of 
partitions of N = {1, 2, . . .}, which arise naturally from i.i.d. sampling. Given T and a sequence 
{^{i) : i e N) of i.i.d. points with law /x, we shall consider two such fragmentation processes. A first 
fragmentation process F = (F(i),i > 0) results from cutting the CTR at its heights. Specifically, 
recall that denotes the root of T. For every x, y e T, let [x, y\ be the segment connecting x and y, 
and define the branch-point xAy as the unique point in T such that [0, x] fl [0, y] — [0, xAy] . Then we 
declare that two distinct integers i j belong to the same block of the partition F(i) if and only if the 
height of the branch-point of ^{i) and ^(j) is greater than t, i.e. d{0, ^{i) A^(j)) > t. In other words, 
the height of the branch-point is the first time at which i and j are disconnected in the fragmentation 
process F. We stress that {i} is a singleton of r(t) whenever the height of ^(i) is smaller than or 
equal to t; in particular F(i) is the partition into singletons whenever t > sup{o?(0, x) : x e T}. 

Recall that {ti, Xiji^i denotes the family of the atoms of a Poisson random measure with intensity 
dt ® dA on the skeleton of T, which is assumed to be independent of the preceding. We denote by 
n = (Jl{t),t > 0) the Aldous-Pitman fragmentation of the Brownian CRT, obtained by declaring 
that two integers i and j belong to the same block of Il{t) if and only if CO)] ^ {^i '■ U ^t} — 0, 
that is if and only if ^(i) and ^{j) belong to the same component of the random forest T{t). 

These two fragmentation processes are related by a sort of time-substitution which is the key to 
Lemma 4. For every i e N and t > 0, denote by Bi{t) the block of Il{t) which contains i and by 
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Bi{t)\ its asymptotic frequency; it is also convenient to agree that Sj(oo) = {i}. Next define 



/ 

^0 



■u 



Pi{t) = inf <^ K > : 



Bi{r)\dr > t 



t > 



with the usual convention inf = oo. Roughly speaking, we use the pi to time-change the fragmen- 
tation n and write n'(t) for the partition whose family of blocks is given by the Bi{pi{t)) for i e N 
(observe that two such blocks are either disjoint or equal). 

Lemma 5 In the notation above, the fragmentation processes T and U' have the same law. 

Proof: The Aldous-Pitman fragmentation 11 is a self-similar fragmentation with index a — 1/2, 
erosion coefficient 0, and dislocation measure denoted here by A, as it is seen from see e.g. Theorem 
3 in [4] and Theorem 5.4 in [7]. According to Theorem 3.3 in [7], the time-changed fragmentation 
n' = (n.{p{t)),t > 0) is then a self-similar fragmentation, now with index a — 1 — —1/2, with no 
erosion and the same dislocation measure A. 

On the other hand, the discussion on pages 339-340 in [6] and the well-known construction of 
the Brownian CRT from twice the normalized Brownian excursion (see for instance Corollary 22 in 
[3]) show that F is again a self-similar fragmentation with index —1/2, no erosion and dislocation 
measure A. Hence F and 11' are two self-similar fragmentations with the same characteristics, they 
thus have the same law; see [7] on page 150. □ 

Lemma 4 should now be obvious. Indeed, by the law of large numbers, the /x-mass of a component 
of T{t) can be recovered as the asymptotic frequency of the corresponding block of the partition, 
and in particular \Bi{t)\ — //^(j)(i). Recall that the height d{0,^{i)) of ^(i) in T can be viewed as 
the first instant t when {i} is a singleton of F(i), a quantity which, in terms of the Aldous-Pitman 
fragmentation 11, corresponds to 



where r(i,j) denotes the first instant t when a mark appears on the segment Combining 
these observations with Lemma 5, we conclude that (8) holds. 




Similarly, for i, j e N with i j, 



d{o, m A m = d{o, m + d{o, e(j)) - , 



and in terms of 11, this quantity corresponds to 
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3.2 Proof of weak convergence 



It is convenient to establish first the convergence in Theorem 1 when cut (7^) is endowed with the 
modified distance as defined in Section 2.1. We write cut'(7^) = ([n]", 5^, 0) for the pointed 
metric measure space equipped with the modified distance and claim: 



Lemma 6 As n — )■ oo, there is the joint convergence in the weak sense induced by the Gromov- 
Prokhorov topology: 



a 



Tn-, cut'(7;^) 



(T,cTcut(T)) . 



Proof: We use the setting and notation of Section 2.2 and derive from Lemma 2 that for every 
fixed integer 



T =^ T 

In ^ ' ) 



and 



2"'E/^-.€"WO'2"0 : ^ e N ^ 2"^ J] /.^(,)(j2-7a) : z G N 

where the two convergences hold jointly; the first in the sense induced by the Gromov-Prokhorov 
topology and the second in the sense of finite- dimensional distributions. 

For every non-increasing function / : [0, oo) [0, 1], there is the bound 



a\ /(t)di-2-^E/0-2-V<7) 



< a I 2-^ + / /(t)dt 



Since the Rayleigh distribution has a finite mean, we deduce from (2) that 



E 



a'- 

j'OO ^ 

a A^^(,)(t)dt-2-^5^/.5(,)(j2-7a) 



as £ ^ OO , 



where of course the left-hand side does not depend on i. 

Similarly, using now Lemma 3, we obtain the uniform bound 



E 



At 

/"OO 

I ^n,Ui) (^)d^ - 2"^ Y] ^n,Ui) ^'2 
^0 ,=1 



< 2-^ + C 



2-^ + 



poo 

J 2^ n{l 



exp{-t/y/ri) 



dt 



C 



V^(l - exp(-2VV^)) ' 
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We thus see that the left-hand side above also tends to as ^ — >■ oo, uniformly in n e N (and again 
these quantities do not depend on i). 

RecaUing that 

poo poo 

<^n(0,CnW)=/ ^in,^^(^){t)^t and 5{0,Cii))^ II^^i^{t)dt, 
Jo Jo 

we conclude that 

(5;(0,en(^)) : z e N) ^ {c75{0,m : ^ e N) 

in the sense of finite-dimensional distributions, and the latter holds jointly with -^Tn T. Essen- 
tially the same argument, using now further the convergence of disconnection times stated in Lemma 
2, shows that the preceding also hold jointly with 

This is precisely the meaning of our statement, since we have seen in Section 1.4 that the doubly- 
infinite sequence : i,j > 0) can be seen as the matrix of mutual distances between the 
root of cut(T) and an i.i.d. sample of points in cut(T). □ 

This immediately entails the convergence stated in Theorem 1. Specifically, recall that ^„(0) = 0; 
combining Lemma 1 and Corollary 1 we get that for all i, j > there is the upper-bound 



E 



-^S^{Ui):Uj))-S'n{U^:Uj)) 



^ AC 



Therefore Lemma 6 can be rephrased as 



Tn, -^CntiTn) ) =^ (T,(7CUt(T)) , 

n \/Ti 



which is the claimed convergence. 



4 Proof of Lemma 3 

The purpose of this final section is to establish Lemma 3. The proof relies on an estimate due to 
Janson [13], combined with an invariance property of the law of Galton- Watson trees under random 
re-planting. We start by the latter; this is the main part of this paper where working with planted 
trees makes the approach simpler. 

It will be convenient in this section to work in the setting of planar rooted trees rather than 



20 



tree structures on a set of labelled vertices. As vertices of planar rooted trees can be canonically 
enumerated, for instance in the breadth first search order, the transformations appearing in this 
section could also be re-phrased in terms of tree structures on a set of labelled vertices, though doing 
so would make the descriptions more involved. 

A Galton- Watson tree T is thus viewed here as a random planar rooted tree. We write V{T) for 
the set of vertices of T; in particular its cardinal |T^(7^| is the total number of individuals in the 
branching process with critical reproduction law u and whose genealogy is represented by T. Recall 
also that T denotes the planted version of T and then |1^(T)| is the number of edges of T. 

A pointed tree is a pair (T, v) where T is a planted planar tree and v a vertex distinct from the 
base, i.e. a vertex of T. We endow the space of pointed trees with a sigma-finite measure GW* 
defined by 

where T denotes a generic planar planted tree and v e V{T). This measure is a classical object 
appearing in particular in the approach by Lyons, Pemantle and Peres, see [14]. 

We now describe a transformation of pointed trees which will be used in the proof of Lemma 3. 
If (T, v) is a pointed tree, we let T„ be the (non-planted) tree formed by all the descendents of v in 
T, including v, and T"" be the subtree obtained by removing all the strict descendents of v in T. We 
first re- plant T'" ai v, viewing the edge connecting v to its parent in T'" as the new base-edge. We 
denote the former base- vertex by v and the new planted tree by T^. Finally we re-graft at v and 
get another pointed tree which we denote by {T,v); see Figure 3 above. 

We stress that the set of vertices of T and of T coincide. More precisely, if we remove the strict 
descendents of -u in T then we get T" (so our notation is coherent), while the subtree formed by 
the descendents of {) in T coincides with T^, i.e. To = Ty. We also note that the transformation of 
pointed trees (T, v) (T, v) is involutive, i.e. its iteration is the identity. 

Proposition 2 The "laws" of {T ^v) and {T,v) under the measure GW* are the same. Equivalently, 
(T'',T„) and {T^,Ty) have same "law" under GW^. 

Proof: We only sketch the proof of this proposition, leaving some technical details to the reader. 
From Lyons et al. [14], we know that under GW*, a typical pointed tree {T,v) can be described in 
terms of Kesten's critical Galton- Watson tree conditioned on non-extinction, or size-biased tree. We 
start by recalling some features concerning the latter. 

The size-biased tree is a random planted tree % with a single infinite branch vo,vi, . . . starting from 
the base vq, that can be constructed as follows. As for a standard Galton- Watson tree, every vertex in 
% has an offspring that is distributed according to the reproduction law v and independently of the 
other vertices, except of course the base, Vq, which has only one child, and the further vertices of the 
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Figure 3: The trees Ty, T'", T'", with the distinguished vertices v and v, with the last picture explaining 
how to construct T from T'" and T„ 

infinite branch, vi,V2, ■ ■ ■, whose offspring distribution is the size-biased measure i/^lk) — ki'{k) (recall 
that the reproduction law v is critical). The size- biased tree T* is constructed inductively starting 
from the base Vq by claiming that the i-th vertex Vi on the infinite branch is chosen uniformly at 
random from the offspring of vi^i. 

For every /i > 1, let T^^ be the planted tree pointed at Vh which is obtained from % by re- 
moving all the strict descendents of Vh, and let GW^ be its law. Now consider an integer h > 1 
"sampled" according to the counting measure on N. Sample the pointed tree {T^^,Vh) as above, 
and independently choose a (non-planted) Galton- Watson tree T'. Next graft the root of T' at the 
pointed vertex Vh of T^*" to form a planted tree, which we denote by T". Then [T" ,Vh) has the 
"distribution" GW*. 

Otherwise said, the "law" under GW* of the distance \v\ ol the pointed vertex v to the base is the 
counting measure on N, and conditionally on \v\ — h, the subtrees T" and Ty are independent. More 
precisely, Ty is a usual, non-planted Galton- Watson tree with offspring distribution v, and T'" has 
the law GW^ 

It is then easy to see that for every h > 1, re-planting the tree T^^ at the vertex Vh leaves its 
distribution GW^ invariant, the pointed vertex Vh in the re-planted tree being the base vertex of T^^ 
(this tree is in our notation). Indeed, the offspring of the vertices along the branch (i>o, I'l, . . . , Vh) 
are the same in both trees, while the subtrees pending from the different offspring oivi, . . . , Vh-i are 
left unchanged. We deduce that {T^^,T') and {72''' ,T') have same "distribution", recalling that h 
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is not a random variable, but rather chosen according to the counting measure on N. This entails 
Proposition 2. □ 

We now turn our attention to n^n,£,n{t)-i the number of edges in the component containing the 
randomly picked edge ^„ in the forest Tn{t). Recall that the latter results from deleting every edge 
with probabihty 1 — exp(— i/y^), independently of the other edges, in the planted Galton- Watson 
tree Tn conditioned to have n edges. In this direction, it is convenient to introduce the following 
notation. If T is a rooted tree and /c > 0, we write Zk{T) for the number of vertices at generation 
k > in T, that is, at distance k from the root. If the tree is planted, then the definition of Zk{T) 
is similar, but counting only the vertices distinct from the base. 



CoroIIciry 2 In the preceding notation, we have 

oo 



k=Q 



Proof: Observe first that for every vertex u e V{7^), the edge pointing down from u to the 
base counts in the enumeration of n^n,€,n{^) edge on the path from to ^„ has 

been removed at time t. Conditionally given Tn,Cn, a-nd for a given vertex u e V{Tn,), this happens 
with probability exp{—{d{u, v{^n)) + l)Vv^)' where d{u, v{^n)) is the graph distance between u and 
v{^n), the extremity of ^„ which is the farther away from the base. We thus have 

where the second identity follows from the fact that given Tn, v{^n) has the uniform law in V{Tn)- 

Next, notice that the set of pointed trees (T, v) with exactly n edges has GW*-measure equal to 
nP(|T| = n), a quantity which is strictly positive and finite by hypothesis. So the conditional law 
GW*(- I 1^(7") I = n) on the space of pointed trees with n edges is well-defined, and corresponds to 
the distribution of {Tn, rj) where given Tn, ?7 is a uniformly chosen vertex in V{Tn)- 

Combining these observations, we deduce that 



E(n/in,€„(t)) = GW, ( J2 e-('^("'"W/^ \V{T)\ 

.ueV{T) 



n 



By definition, the number of vertices u e V{T) at distance k from the pointed vertex v equals 
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Zk-i{T^) + Zf;{Ty), where we adopt the convention Z_i{T) — 0. Therefore, 

ueV{T) k>0 

< ^e-('=+i)*/^(Zfc(f^) + Z,(T,)). 

fe>0 

We conclude that 

nniin,uit)) < 5]e-(^+^)*/^GW, (Z,(f + Z,(T,) | \V{T)\ = n) , 

fe>0 

while we have from Proposition 2 

GW. (Zfc(f ^) + Zfc(n) I \V{T)\ = n) = GW, (Zfc(T^) + Zk{T,) \ \V{T)\ = n) 

= GW4Zk{T)\\V{T)\=n) 

= E(Zfe(r„)), 

which completes the proof. □ 

Lemma 3 now follows readily from the following result by Janson; see Theorem 1.13 in [13]: There 
exists some constant C depending only on the offspring distribution i/, such that 

supE{Zk{Tn))<C{k + l) 

n>l 

for every k >0. Indeed, we derive from Corollary 2 that for every n >2, 

E(/^n^ (t)) <-y{k + l)e-^'+'WV^ = Cexp(-t/v/^ 
vpn,4„v JJ - ^ Z^y ! n(l - exp(-Vv^))2 



ifc>0 
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